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Abstract 

Natural disasters may have considerable impact on society as well as on (re) insurance 
industry. Max-stable processes are ideally suited for the modeling of the spatial ex¬ 
tent of such extreme events, but it is often assumed that there is no temporal 
dependence. Only a few papers have introduced spatio-temporal max-stable mod¬ 
els, extending the Smith, Schlather and Brown-Resnick spatial processes. These 
models suffer from two major drawbacks: time plays a similar role as space and 
the temporal dynamics is not explicit. In order to overcome these defects, we in¬ 
troduce spatio-temporal max-stable models where we partly decouple the influence 
of time and space in their spectral representations. We introduce both continuous 
and discrete-time versions. We then consider particular Markovian cases with a 
max-autoregressive representation and discuss their properties. Finally, we briefly 
propose an inference methodology which is tested through a simulation study. 

Key words: Extreme value theory; Spatio-temporal max-stable processes; Spectral 
separability; Temporal dependence. 


1 Introduction 


In the context of climate change, some extreme events tend to be more and more frequent; 
see e.g. SwissRe (2014). Meteorological and more generally environmental disasters 
have a considerable impact on society as well as the (re)insurance industry. Hence, the 
statistical modeling of extremes constitutes a crucial challenge. Extreme value theory 
(EVT) provides powerful statistical tools for this purpose. 

EVT can basically be divided into three different streams closely linked to each other: 
the univariate case, the multivariate case and the theory of max-stable processes. For an 


introduction to the univariate theory, see e.g. 

Coles ( 

2001 

) and for a detailed description, 

see e.g. 

Embrechts et ah 

(1997 

) or 

Beirlant et ah 

(200 

6). In the multivariate 

case, we refer 

to 

Resnick 

(1987 

), 

Beirlant et al. 

(2006 

) and 

de Haan and Ferreira 

(2007 

). Max-stable 


processes constitute an extension of EVT to the level of stochastic processes (de Haan 


1984, de Haan and Pickands 1986) and are very well suited for the modeling of spatial 


extremes. Indeed, under fairly general conditions, it can be shown that the distribution of 
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the random field of the suitably normalized temporal maxima at each point of the space 
is necessarily max-stable when the number of temporal observations tends to infinity. For 


a detailed overview of max-stable processes, we refer to de Haan and Ferreira (2007) 


In the literature about max-stable processes, measurements are often assumed to be 
independent in time and thus only the spatial structure is studied (see e.g. Padoan et ah 


2010). Nevertheless, the temporal dimension should be taken into account in a proper 


way. To the best of our knowledge, only a few papers focus on such a question. The ma¬ 
jority of the spatio-temporal models introduced is based on Schlather’s spectral represen¬ 


tation ( 

Penrose 

1992, 

Schlather 

2002 

which has given 

(Schlather, 

2002 

and Brown-Resnick 

Kabluchko et al. 


tation tells us that if (£/*)*> 1 generates a Poisson point process on (0, oo) with intensity 
u~ 2 du and ((F):(y)) ygR d)i>i are independent and identically distributed (iid) non-negative 
stationary stochastic processes such that E[17(y)] = 1 for each y G R d , then the process 
(V ~1 {b^T)(y)}) y e]Rd is stationary simple max-stable, where simple means that the mar¬ 
gins are standard Frechet. Here \J denotes the max-operator. In Davis et al. (2013a), 


Huser and Davison] (2014) and Buhl and Kliippelberg (2015), the idea underlying the con¬ 


struction of the spatio-temporal model is to divide the dimension d into the dimension 
d— 1 for the spatial component and the dimension 1 for the time. Davis et al. (2013a) in¬ 


troduce the Brown-Resnick model in space and time by taking a log-normal process for Y t 


while Buhl and Kliippelberg (2015) introduce an extension of this model to the anisotropic 


setting. Huser and Davison (2014) consider an extension of the Schlather model by using 


a truncated Gaussian process for the Y t and a random set that allows the process to be 
mixing in space as well as to exhibit a spatial propagation. Advantages of these models 
lie in the facts that the Schlather and Brown-Resnick models have been widely studied 
and that the large literature about spatio-temporal correlation functions for Gaussian 
processes can be used, allowing for a considerable diversity of spatio-temporal behav¬ 
ior. 


Davis et al (2013a) also introduce the spatio-temporal version of the Smith model 
(Smith[ 1990) that is based on de Haan’s spectral representation (see de Haan 1984). 


e de Haan, 

If ( Ui , Ci)i >i are the points of a Poisson point process on (0, oo) x R d with intensity 


u 


-2 


du x dc and if g y are measurable non-negative functions satisfying f Rd g y (c)dc = 1 
for each y G R d , then the process (Vj^i { U,g y (Cj)}) yg]R d is a simple max-stable process. 
However, they do not allow any interaction between the spatial components and the tem¬ 
poral one in the underlying covariance matrix. The previous spatio-temporal max-stable 
models suffer from some defects. First, they are all continuous-time processes whereas 
measurements in environmental science are often time-discrete. Second, time has no spe¬ 
cific role but is equivalent to an additional spatial dimension. Especially, the spatial and 
temporal distributions belong to a similar class of models. This constitutes a serious 
drawback since such a similarity is not supported by any physical argument. Third, the 
temporal dynamics is not explicit and hence difficult to identify and interpret. Finally, 
these models have in general no causal representation. 

The theory of linear ARMA processes has led to the max-autoregressive moving aver¬ 
age processes (MARMAQo, q)) introduced by Davis and Resnick (1989). The real-valued 
process (A"(t)) tg ^ follows the MARMA(p, q) model if it satisfies the recursion 

X(t) = max(0!A"(t — 1),..., 4> p X(t — p),Z(t), 9iZ(t — 1),..., 9 q Z(t — g)), t G Z, 

where (pi,9j > 0 for i = 1 , ... ,p and j = 1 , ... ,q and the max-stable random variables 
Z(t ) for t G Z are iid. It is a time series model which is max-stable in time. However, the 
spatial aspect is absent. An interesting approach to build spatio-temporal max-stable 
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processes could be inspired by the theory of linear processes in function spaces such 
as Hilbert and Banach spaces (see e.g. Bosq, 2000) and especially by the autoregressive 


Hilbertian model of order 1, ARH(l) (see e.g. Bosq, 2000, Hormann and Kokoszka, 2012). 
We say that a sequence (X(t))tez of mean zero functions in an Hilbert space H follows 
an ARH(l) process, if 


x{t) = v{x(t-i)) + z{t), te z, 


where ft is a bounded linear operator from H to H and (Z(t))tzz is a sequence of iid mean 
zero functions in H satisfying E(||Z(f)|| 2 ) < oo, where ||.|| denotes the norm induced 
by the scalar product on H. Various types of linear transformations can be applied 
though the most commonly used is the local average operator which involves a kernel. A 
transposition of this model to the context of the maximum instead of the sum could for 
instance be written as 


X(t,x) = max(V(X(t - 1, -))(x),Z(t,x )), (t,x) e (Z,R d x ), (1) 


where ((Z(t, x)) xeR d-i) te % is a sequence of iid spatial max-stable processes and T is an 
operator from the space of continuous functions on R d_1 to itself such that, if (X(t — 
l,x)) xgR d-i is max-stable in space, then (\P(X(£ — 1, -))(x)) xe ]Rd-i is also max-stable in 
space. Such an operator could for instance be a “moving-maxima” operator 


vk(X(t-l,-))(x)= V {K(*,x)X(t- l,s)}, x G R d_1 , 

seR d_1 


where K is a kernel (see Meinguet (2012) for a similar idea), or an operator combining a 
translation in space with a scaling transformation 

^f(X(t — 1, -))(x) = aX(t — 1, x — r), x G R d_1 , 


where a G (0,1) and r G R d_1 . 

In this paper, we propose a class of models where we partly decouple the influence 
of time and space, but such that time influences space through a bijective operator on 
space. We present both continuous-time and discrete-time versions. A first advantage of 
this class of models lies in their flexibility since they allow the marginal distribution in 
time to belong to a different class than the stationary distribution in space. Actually, 
these margins can be chosen in function of the application. Due to the spatial operator 
mentioned above, our models are able to account for physical processes such as propa¬ 
gations/contagions/diffusions. Furthermore, the estimation procedure can be simplified 
since the purely spatial parameters can be estimated independently of the purely temporal 
ones. 

Then, we study some particular sub-classes of our general class of models, where the 
function related to time in the spectral representation is the exponential density (in the 
continuous-time case) or takes as values the probabilities of a geometric random variable 
(in the discrete-time case). In this context, our models become Markovian and have a 
max-autoregressive representation. This makes the dynamics of these models explicit and 
easy to interpret physically. 

The remaining of the paper is organized as follows. Section 2 presents our class 
of spectrally separable space-time max-stable models. In Section 3, we focus on the 
particular Markovian cases where the space is R 2 and the unit sphere in R 3 , respectively. 
Section 4 briefly presents an estimation procedure as well as an application of the latter 
on simulated data. Some concluding remarks are given in Section 5. 
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2 A new class of space-time max-stable models 


The time index t and space index x will belong respectively to the sets X and X. The 
models we introduce will be either continuous-time (X = R) or discrete-time (X = Z). 
In the following, we denote by S the Lebesgue measure on R in the case X = R and the 
counting measure d{ z } when X = Z, where d stands for the Dirac measure. 

To define discrete-time models, we need to introduce the notion of homogeneous 
Poisson point process on Z. Let (A/.) fcgZ be iid Poisson(l). For AcZ,JV (A) = Ylk&A Nk 
defines an homogeneous Poisson point process on Z with constant intensity equal to one 
(see Appendix [A]) . Note that N is not a simple point process. 

Space-time simple max-stable processes on X x X allow for a spectral representation 
of the following form (see e.g. de Haan (1984)): 


X(t,x) = \fiUiVwiWi)}, (t,x) GX x X , (2) 

1=1 


where (C/j,Wj)j>i are the points of a Poisson point process on (0, oo) x E with inten¬ 
sity u~ 2 du x p(dw) for some Polish measure space (X 1 , X, /i) and the functions Vq iX ) : 
E —» (0, oo) are measurable such that f E V(t lX )(w)/j,(dw) = 1 for each (t,x) GX x X. A 
class of space-time max-stable models avoiding the previously mentioned shortcomings is 
introduced below. 


Definition 1 (Space-time max-stable models with spectral separability). The class of 
space-time max-stable models with spectral separability is defined inserting the following 
spectral decomposition in (|2]).- 

W^i) = vabw ( 3 ) 

where: 


• (Ui, Bi, Cf) j>i are the points of a Poisson point process on (0, oo) x Ei x E 2 with 
intensity u~ 2 du x pfidb) x /i 2 (dc) for some Polish measure spaces (X 1; X i, pi) and 
(X 2 , X 2 , /i 2 ); 


• the operators R(t,b) ore bijective from X to X for each (t, b) Glx L; 

• the functions V) : E\ —> (0,oo) are measurable such that f E Vt{b)p\{db) = 1 
/or eac/i t G X and fde functions V x : X 2 —>■ (0, oo) are measurable such that 
f E V r x (c)p 2 (dc) = 1 for each x G T. 

We emphasize that the models belonging to this class are max-stable in space and 
time, since 


V(t,x)(w)fj,(dw) = 


' E\ X i?2 


V t (b)V R(tb)X (c)ni(db)iJ, 2 (dc) 


'Ei 


= I V t (b) ( / Vr ( ti6)X (c)/J 2 (dc) ) /xi(d6) 


£^2 


= / VtWmidb) = l, 
J Ex 


but of course also in space and in time only. A spectral decomposition in space e.g. is 
easily derived since, for a fixed t, (UiVfiBf), Ci)i>i defines a Poisson point process on 
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(0, oo) x E 2 with intensity vT 2 du x /i 2 ( dc ) and f E> Vr ( 4 6)X (c)// 2 (dc) = 1 for each x EX 
and b E Ei. 

The crucial point in the previous definition lies in the fact that we have decoupled the 
spectral functions with respect to time and the spectral functions with respect to space 
given time. This allows one to deal with the temporal and the spatial aspects separately. 
Moreover, the latter depends on time through a bijective transformation which typically 
may account for an underlying physical process. 

The finite dimensional distributions of A" in ([3]) are given, for M E HNT\{0}, t x ,..., £m £ 
X, xi,..., x M E X and z x ,..., z M > 0, by 


log(P(X(f 1 ,x 1 ) < z l7 ... ,X(t M ,x. M ) < z M )) 
M 'V,Jb)V R ^, Xm (c) 


V 

>E 1 xE 2 m=1 


jii(db)ii 2 (dc). 


(4) 


We now provide some examples of sub-classes of the general class of space-time max- 
stable processes given in Definition [l] 

i) Models of type 1: de Haan’s representation with X = R 2 

We take E\ — X with /q = S and E 2 = X = R 2 with /i 2 = A 2 , where A 2 is the 
Lebesgue measure on R 2 . Let g be a probability density function (case X = R) or a 
discrete probability distribution (case X = Z), and / be a probability density function 
on R 2 . We then assume that 


Vt{b)=g(t-b), 14(c) = /(x-c) 

and that the operators R(t,b) are translations: for all t,b E X and x E R 2 , R(t,b ) x = 
x — (t — b)r , where r G R 2 . 

The class of moving maxima max-stable processes with general spectral representation 
(|2| assumes the existence of a probability density function h on X x R 2 such that 

V(t,x) (iu) = h (t - 6, x - c). 

The density function h can always be decomposed as follows: 

h(t,x) =g(t) hi (x| t), 

where h\ (x|f) is the conditional probability density function on R 2 given t. For models 
of type 1, we have implicitly assumed that this density function satisfies the equality 
hi (x|t) = / (x — fr). 

Models of this type are interesting in practice since, as we will see in the next section, 
they have a max-autoregressive representation for a well chosen function g. The latter 
makes the dynamics explicit. Moreover, the translation operator allows to model physical 
processes such as propagation and diffusion. 

We denote by S 2 = {x E R 3 : ||x|| = 1}, the unit sphere in R 3 . 

ii) Models of type 2: de Haan’s representation with X = S 2 

We choose E\ = X with /q = S and E 2 = X = S 2 with /i 2 = A$ 2 , where A«j 2 is the 
Lebesgue measure on S 2 . Let g be a probability density function (case X = R) or a 
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discrete probability distribution (case X = Z) and / be the von Mises-Fisher probability 
density function on S 2 with parameters /i € S 2 and k > 0: 

/( x ; Ab k) = --—— exp (/c/i'x) , x G S 2 . (5) 

47 T smh k 

The parameters /i and k are called the mean direction and concentration parameter, 
respectively. The greater the value of k, the higher the concentration of the distribution 
around the mean direction /i. The distribution is uniform on the sphere for k = 0 and 
unimodal for k > 0. We assume that 

Vt(b) = g(t-b), 14 (c) = f(x;c,K) 

and that, for u = ( u x ,u y ,u z )' £ S 2 , R(t,b) = Re(t-b), u, where Rg u is the rotation matrix 
of angle 9 around an axis in the direction of u. We have that 

Re,u = cos 0/ 3 + sin 9 [u] x + (1 — cos 6 ) )uu / , 

where J 3 is the identity matrix of R 3 and [u] x the cross product matrix of u, defined by 

/ 0 -u z u y \ 

[u] x = \ U z 0 -u x . 

V ^y 0 ) 

To the best of our knowledge, the resulting models are the first max-stable mod¬ 
els on a sphere. Such models can of course be relevant in practice due to the natural 
spherical shape of planets and stars. Moreover, as before, this type of model has a 
max-autoregressive representation for an appropriate function g. 

iii) Models of type 3: Schlather’s representation with X = R 2 

For d £ HNT\{0}, let Cg = C (R d ,R + \{0}) be the space of continuous functions from 
R d to R + \{0}. For this sub-class of models, (Ei,Si,ni) and (E 2 , £ 2 , ^ 2 ) are probability 
spaces with E { = C\, E 2 = C 2 and /j 1 and /i 2 are probability measures on E\ and E 2 , 
respectively. The function V t (respectively V x ) is dehned as the natural projection from 
Ci (respectively C 2 ) to R + such that 

Vt(b) = b(t) and 14(c) = c(x), 

with the conditions that E[ 6 j(t)] = 1 and E[c*(x)] = 1. Note that for notational consis¬ 
tency, we use small letters for the stochastic processes b and c. The spectral processes 
Cj are assumed to be either stationary and in this case R(t,b ) x = x — fr where r G R 2 , 
or to be isotropic and in this case i?(t,b)X = Wx where A is an orthogonal matrix ( R{t,b ) 
corresponds to a rotation). 

iv) Models of type 4: Mixed representation with X = R 2 

We choose E\ — X, Hi — 8, X — R 2 , E 2 — C 2 . Let g be a probability density function 
(case X = R) or a discrete probability distribution (case X = Z) and /i 2 a probability 
measure on C 2 . We take 

V t (b) = git- b) and 14(c) = c(x). 

As in the previous case, 14 is the natural projection from C 2 to R + . Once again, note that 
we use a small letter for the stochastic process c. The spectral processes c t are assumed 
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to be stationary and R(t,b ) x = x — (t — 6)r, where r E R 2 . As for models of types 1 
and 2 , the processes of this type can be written under a max-autoregressive form for a 
well-chosen function g. 

We now focus on the stationary distributions in space i.e. when we consider a fixed 
time t and look at the spatial dimension. For a fixed t G Z, we define the process 
(X t (x)) xeX = (X(t, x)) xgA ,. For two processes, = denotes equality in distribution for any 
finite dimensional vectors of the two processes. 

Theorem 1 (Stationary distributions in space). For a fixed t G I, assume that for each 
M G 1N\{0} ? b G Ei, x 1; ..., x M G X and Z\, ... , Zu > 0, we have that 



Then we have that 

OO 

X t (x) = \J {U t \ 4(a)} , x G R 2 , 

i =1 

where ([/,, (7j)j>i are the points of a Poisson point process on (0, oo) x E 2 with intensity 
u~ 2 du x p- 2 (dc). Moreover, Assumption © is satisfied for models of types 1, 2, 3 and f. 


We see in Theorem [T| that the spectral separability and the use of specific operators 
R make the spectral function V x (with its associated point process (Cf)i> i) the function 
which appears in the spatial spectral representation. The stationary distribution in space 
only depends on the spatial parameters of the model. This property is interesting from a 
statistical point of view since any estimation procedure can be simplified by considering 
in a first step the spatial parameters only without taking into account the temporal ones 
(see Section 4). Note that the idea of using a transformation of space in (Jb]) can also be 
found in Strokorb et al. (2015), in a different context. 


We now look at the marginal distributions in time i.e. when we consider a fixed site 
x G X. As previously, we define the process (X x (t)) teX = (X(t,x)) teX . 


Theorem 2 (Marginal distributions in time). For a fixed x G X, assume that there exist 
two operators S and G from X to X such that 


R(t,b)S {t) X d*(b)X. 


(7) 


Then we have that 

OO 

As (0 xW = \J {UiVtiBi}} , te z, 

i =1 

where {Ui,Bf)i>i are the points of a Poisson point process on (0, oo) x E\ with intensity 
u~ 2 du x pLi ( db ). Assumption (JT]) is satisfied for models of types 1 and f with S^x = x+tr, 
for models of type 2 with SmX = R_g t u x and for models of type 3 with S( t )X = x + tr or 
S( t) x = A~ f x, where r G R 2 and A is an orthogonal matrix. 

Contrary to Theorem [lj it is not possible to say that the marginal distributions in 
time are those given by the temporal spectral representation with the spectral function 
V t and its associated point process In order to obtain such a representation, it 

is necessary to apply a time transformation Sm on x. As a consequence, it is difficult to 
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estimate the temporal parameters separately since this transformation is not necessarily 
known in practice. The transformation indeed depends on the type of model and the 
parameters we want to estimate. Note that if Rt.b does not depend on t (for instance 
the translation with r = 0), i.e. if space and time are fully separated in the spectral 
representation, then S(t) is equal to the identity. 

3 Markovian cases 

In this section, in the case X = R, g is the density of a standard exponential random 
variable whereas in the case X = Z , g corresponds to the probability weights of a geometric 
random variable: 



where v > 0 and </> G (0,1). We first consider models of type 1 and type 4 and then models 
of type 2. The choice of the function g in ([ 8 ]) makes these spatio-temporal max-stable 
models Markovian. 


3.1 Markovian models of type 1 and type 4 

Recall that we assume the transformations Ro,b) to be translations: = x—(f— 6 )r, 

where r € R 2 . The parameter r gives a preferred direction of propagation of the process. 
In this context, we obtain 


X(t,x) 


V»>i {Uiis exp(—v(t — Bi))I {t - Bi >o } V x -(t-Bi)T(Ci)} if X = R 
Vi>! {^>(1 - if X = Z • W 


Note that for X = R, the function g has been introduced by |Dombry and Eyi-Minko 


(2014), under the form g(t ) = — log(a)a t I{t>o} for a G (0,1), in order to build the 
continuous-time version of the real-valued max-AR(l) process. 

The following result shows in particular that the process X defined in ([9]) satisfies a 
stochastic recurrence equation. Let us denote by a the constant exp(— v) if X = R and 
the constant f if X = Z. 


Theorem 3. i) For all t,s G X such that s < t, we have that 


X(t , x) = max(a s X(£ — s, x — sr), (1 — a s )Z[t , x)), 


( 10 ) 


where the process (Z(t, x )) xgR 2 is independent of(X(t — ■s,x )) xgR 2 and 


x gr 2 , 

i —1 


( 11 ) 


with ( Ui , Ci)i> i the points of a Poisson point process on (0, oo) x E 2 of intensity u 2 du x 
p 2 (dc). 

ii) Let X = Z and ((Z(t,x)) xgR i) te z be a family of iid max-stable processes with 
spectral representation (11), we have that 


A "(^ x ) = V {a 3 ~ a)Z{t - j,x- jr)} 

3=0 


(12) 
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From Theorem[3j it can be seen that our model X extends the real-valued MARMA(1, 0) 
process of Davis and Resnick (1989) to the spatial setting. The parameter a measures the 


influence of the past, whereas the parameter r represents some kind of specific direction 
of propagation (contagion) in space. For the sake of ease of interpretation, consider the 
case where Z = Z and s — 1. The value at location x and time t is either related to 
the value at location x — r at time t — 1 or to the value of another process (the innova¬ 
tion), Z, that characterizes a new event happening at location x. If the value at location 
x — r and a are large, it is likely that there will be a propagation from location x — r 
to location x, i.e. contagion of the extremes, with an attenuation effect. Contrary to the 
existing spatio-temporal max-stable models, the dynamics is described by an equation 
that can be physically interpreted. Note that the translation by the vector —r is one of 
the easiest transformations that allows to broaden the direct extension of the real-valued 
MARMA(1, 0) model to a spatial setting. 

Moreover, the combination of Theorems [l] and [3] shows that the stationary spatial 
distribution of the Markov process/chain ((X(t, x)) xg R 2 ) te;r is the same as that of Z. It 
is important to remark that C 2 (its state space) equipped e.g. with the topology induced 
by the distance d(/i, / 2 ) = sup(min(|/ 1 (x) —/ 2 (x)|, 1) : x 6 R 2 ), for two functions f\ and 
/ 2 , is not locally compact. Therefore, the theory developed e.g. in Meyn and Tweedie 


(2009) cannot be used to derive additional properties. 


We now consider the special case 


X(t,x) = max(aX(£ — l,x — r), (1 — a)Z(t,x)), t E Z, 


(13) 


where a E (0,1), r E R 2 are both fixed and ((Z(£, x)) xgK 2 ) tgZ is a sequence of iid spatial 


max-stable processes with spectral representation (11). The general distribution function 
of this process is given in the next proposition. 

Proposition 1. For M E 1N\{0}, ti < ■ ■ ■ < tM G Z, xi, ..., xm E R 2 and z i,..., Zm > 
0 , we have that 


-log (P(AT(fi,xi) < z l: ...,X(t M ,x M ) < z M )) 

_ v (~ Z2 Zm 

— Vxi,X2-(t2-ti)r,...,x M -(t M -ti)r I ^ 1 ) 


* 2—*1 ’ ’ 


M—l 


A ^ 1 (1 a )V Xm;Xm+ i—( W 1 — t m )x,...,x M -(t M -t m )r (zm, 


m =2 


Zm +1 

Qtm+l tm 


Zm 


tn 


a 


a 


tM—tM-l 


Zm 

where V is the exponent function characterizing the spatial distribution, defined by 

r M 

(z 1 ,z 2 ,...,z M ) = I V 


(14) 


V 


Xl,X2,...,X M 


'R 2 m= 1 


Vxjc) 


/r 2 (dc). 


By using the approach developed by Bienvenue and Robert (2014), the right-hand 
term of (14) can easily be computed provided that the distribution of (Vk m (c)) m=] M 


with c ~ fi 2 is absolutely continuous with respect to the Lebesgue measure. This is the 
case for example for the spatial Schlather and Brown-Resnick processes. 

It is easily shown that the models of types 1 and 4 are stationary in space and time. In 
order to measure the spatio-temporal dependence, we propose extensions to the spatio- 
temporal setting of quantities that have been introduced in the spatial context. The 
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first one is the spatio-temporal extremal coefficient function, stemming from the spatial 
version by Schlather and Tawn (2003), which is defined for all ti,t 2 G Z and xi,X 2 € R 2 

by 

6(t 2 - ti ,x 2 - x x ) 


P(X(ti,xi) < u,X(t 2 ,x. 2 ) <u) — exp 


u 


u > 0. 


The second one is the spatio-temporal <L i-madogram, coming from the spatial version 
introduced by Cooley et ah (2006), where $1 is the standard Frechet probability distri¬ 
bution function. It is defined by 

^(4 ~ h ,x 2 - xi) = ^E[|$i(X(t 2 ,x 2 )) - <hi(X(ti, Xl ))|]. 


Proposition 2. In the case of (13), for ti,t 2 G Z and xi,x 2 G R 2 , the spatio-temporal 
extremal coefficient is given by 


0(t 2 - ti, x 2 - xi) = V Xl , X2 _(t 2 _ tl ) T (l, a* 1 t2 ) + 1 - 


,t2~ tl 


(15) 


and the spatio-temporal <f> i-madogram of X is given by 


1 0{t 2 - t 1; x 2 — xi) — 1 


. .1 Uyi2 — -*-2 — -X-l ) — ± ± l 

I't, <2 1 1 ,X 2 x, 2 0(t 2 — ti,x 2 — Xi) + 1 2 V xllX2 _( fa _i l)T (l.o'i-fe) +2 - 


(16) 


Similarly, it would also be possible to extend the A-madogram, introduced by Naveau! 


et al. (2009), to the spatio-temporal setting. 


Proposition [2] shows that we do not fully separate space and time in the extremal 
dependence measure given by the extremal coefficient, even if r = 0. On the other hand, 
in the latter case, space and time are entirely separated in the spectral representation: 
V t only depends on time and Vr^x = 14 only depends on space. 

Furthermore, denoting l = t 2 — 1 1 and h = x 2 — x 1; we have lim^ oo 0(/,h) = 2, 


showing asymptotic time-independence. Moreover, from Kabluchko and Schlather (2010), 
we deduce that, for a fixed x, the process (X x (t)) tex is strongly mixing in time. If t\ = t 2 , 
lim||h|i-xx> h) = 2 if and only if X is strongly mixing in space. 

Before showing some simulations, let us define the spatial Smith and Schlather mod¬ 
els. Let (Ui, Ci)i >i be the points of a Poisson point process on (0, oo) x R 2 with in¬ 
tensity u~ 2 du x A 2 (dc) and let h 5 denote the bivariate Gaussian density with mean 0 


and covariance matrix E. Then the spatial Smith model (Smith, 1990) is defined as 


Z(x) = — Q)}, for x G R 2 . Let (C/j)j> 1 be the points of a Poisson point 

process on (0, 00 ) with intensity u~ 2 du and Y l7 Y 2} ... independent replications of the 
stochastic process Y(x) = v / 2vr^(x), for x G R 2 , where £ is a stationary standard Gaus¬ 


sian process with correlation function p(.). Then the spatial Schlather process (Schlather 
2002) is defined as Z(x) = V^il UiY^x.)}, for x G R 2 . 


In the left panel of Figure [lj we show the evolution of the process (13) when Z is a 
spatial Smith process with covariance matrix 


E = 


1 O' 

0 l. 1 ’ 


a = 0.7 and r = (—1,-1)' (translation to the bottom left). In the right panel of 
Figure [lj we show the evolution of the process (13) when Z is a spatial Schlather process 
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exp 


with correlation function of type powered exponential, defined, for all h > 0 by p(h) = 
c 2 1 

for Ci > 0 and 0 < C 2 < 2, where Ci and C 2 are the range and the 
smoothing parameters, respectively. We take c\ = 3, c 2 = 1 and, as previously, a = 0.7 
and r = (—1, —l)'. Note that the process (13) with Z being the spatial Smith and the 
spatial Schlather process corresponds to models of types 1 and 4, respectively. 


m 

as 

c 

T3 

O 

O 


as 

C/D 


o 



0 2 4 6 8 100 2 4 6 8 10 

First coordinate 



0 2 4 6 8 100 2 4 6 8 10 

First coordinate 


Figure 1: Simulation of the process ( Jl3| ) for Z being a spatial Smith process (left panel) 
and a spatial Schlather process (right panel). We depict the logarithm of the value of 
the process in space. In both cases, the evolution over 4 periods is represented (top left: 
t — 1, top right: t — 2, bottom left: t = 3, bottom right: t — 4). 


In both cases, we observe a translation of the main spatial structures (the “storms” in 
the case of the spatial Smith model) to the bottom left, hence highlighting the usefulness 
of models like (13) for phenomena that propagate in space. 


3.2 Markovian models of type 2 

Let Cg 2 = C(S 2 ,R + \{0}) be the space of continuous functions from S 2 to R + \{0}. For 
h G C$ 2 , let Ilh-H^ = sup xeS 2 h(x) and d(hi, h 2 ) = \\h\ — h 2 \\ 00 ', then C $2 is a Polish space 


(see e.g. Kechris 1995, Theorem 4.19). 

By using the same arguments as in Theorem [3j we deduce that the models of type 2 
satisfy the following stochastic recurrence equation: 


X(t, x) = ma x(aX(t — 1, Re i, u x), (1 — a)Z(t, x)), (17) 

where the process (Z(t,x )) xe $2 is independent of (X(t — l,x)) xe s 2 and is such that 

OO 

z (t,x) = \/ {Uif(x-,Hi,K)}, XGS 2 , (18) 

i =1 

with (U t , /Xj)j>! the points of a Poisson point process on (0, 00 ) x S 2 with intensity u~ 2 du x 
d\$ 2 . Therefore, ((W(t,x)) xeS z) tgZ is a Markov chain with state space C$ 2 . Its transition 
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kernel is denoted by V. By using non standard results for Markov chains on Polish 


spaces (Hairer (2010)), it is possible to show that the transition kernel converges towards 


a unique invariant measure at an exponential rate. Note that the state space considered 


in Section |3.1[ C 2 , is not separable and hence is not a Polish space, yielding that the 
results of Hairer (2010) cannot be used in that case. 

Let L be a function from C $2 to [0, 00 ) and let us introduce a weighted supremum 
norm on the space of functions from C $2 to R in the following way. For ip : C $2 1 —)■ R, we 
define 


= sup 


\(p(h)\ 


1L £&i + L(hy 

Finally, for a distribution 7 r on C $2 and <p : C $2 1 —» R, let us denote 7r(<p) = f c ^ <p(h)d7r(h). 


By Theorem 3.6 in Hairer (2010), we deduce the following result about the geometric 
ergodicity of the Markov chain ((X(t, x)) xG $ 2 ) tgZ . 

Theorem 4 (Geometric ergodicity). There exists a unique invariant measure n for the 
Markov chain ((X(t, x)) xg $ 2 ) tgZ . Let, for h G C$ 2 , L{h) = Hh 7 ^ with 0 < 7 < 1. There 
exist constants C > 0 and p G (0,1) such that 

\\V n ip - vr((^)|| L < Cp n || ip - n{ip)\\ L 

holds for every measurable function ip : C$ 2 1 —> R such that ||<p|| L < 00 . 


4 Estimation on simulated data 


In this section, we briefly discuss statistical inference for the process (13). We denote by 6 


the vector gathering the parameters to be estimated. One possible method of estimation 


consists in using the pairwise likelihood (see e.g. Davis et al. 2013b), which requires the 
knowledge of the bivariate density function for each t\,t 2 G R and Xi,X 2 G R 2 . The 
latter is given in the following proposition. 

Proposition 3. For ti,t 2 G R, x 1; x 2 G R 2 and Z\,Z 2 > 0, the bivariate density of the 


process (13) is given by 


f(t i,xi),(t 2 ,x 2 ) { z li z 2 j 0) 


exp I V Xl , X2 _p 2 _ti )T ( Z\, 


z 2 


7*2 ~tl 


a 


t2~tl 


z 2 


( ^ 1 Vxi,Xa_(t2 “ tl)T ( Zu X ( 0* 2 Vxi,Xa - (t2 " tl)T ( Zl ' 

d d ,, ( z 2 \ 

d Z \ ^ XllXa " (ta -* l)T \ Zl, a^J 


z 2 


* 2—*1 


+ 


1 — a 


t2~tl 


7 2 

^2 


(19) 


Regarding the process Z appearing in (13), we only consider the case of the spatial 


Smith model. Its covariance matrix is denoted 

£ _ | 0*11 CTl2 


7 X 12 cr 2 2 / 

and 6 is now given by (an, 072 , U 22 , a, T ')'■ The bivariate density function is given below. 
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Corollary 1. We denote by wi = ^ log ( a t 2 - 2 t 1 , i ) and v\ 


h\ — w\, where 


hi = \J (x 2 - (t 2 - t\)r - xi)'E 1 (x 2 - (t 2 - h )t - xi). 

Let f x ,f 2 G R, x 1 ; x 2 G R 2 and zi,z 2 > 0. The bivariate density function of the process 
(13) with Z being the spatial Smith process is given by 


f(t i,xi),(t 2 ,x 2 ) ( z li z 2i 0) 


= exp 


x 






t2~tl 


$(wi) 

Zi 

$(wi) : <f>(wi) a t2_tl 0(v i)\ ^ (a t2 ~ tl $(v i) : a t2_tl 0(vi) 4>{wi) t 1 
?, h . 7T~ : X I ~ r ?; “ + — 


hizf 


hiziz 2 J 


a 


t2~tl 


hiz^ 


hiziz 2 


Vi4>(wi) a t2 ^Wi^vi) 


h\z\z 2 


h\ziZ2 


where $ and <f are the probability distribution function and the probability density function 
of a standard Gaussian random variable. 

Assume that we observe the process at M locations xi,... ,xm and N dates 1 1 ,..., tjy. 
Then, the spatio-temporal pairwise log-likelihood is defined by (see e.g. 

2013b) 

N- 1 N M—l M 

Lp (0) — 'y ^ y ^ y ^ y ^ log (f.xi){zi,kiZj.i, 0)), (20) 

i= 1 j=i +1 fc=l l=k+l 

where the uJ hJ and the uik,i are temporal and spatial weights, respectively, and z U)iri denotes 
the observation of the process at date n and site m. Then, the maximum pairwise 
likelihood estimator is given by 6 = argmax Lp ( 6 ). 

We will consider two different estimation schemes: 


Davis et al. 


- Scheme 1: As previously explained, due to Theorem [lj it is possible to separate 
the estimation of Q\ = (an, a 12 , a 22 )' and 0 2 = (a, r')' . The estimation of is 
carried out in a first step by maximizing the spatial pairwise log-likelihood (see 
). Once 9\ is known, we estimate 0 2 by maximizing Lp T (0 2 ) 

with respect to 0 2 . 


Padoan et al. 2010 


Scheme 2: We optimize L S p (0) with respect to 6, meaning that we estimate all 
parameters in a single step. 


As illustration of the above, we simulate 100 times the process (13) (where Z is the 
spatial Smith process) with parameter 6 = (1, 0,1, 0.7, —1, —1) at M sites and N dates. 
We compute statistical summaries from the 100 estimates obtained. In both schemes, 
we optimize Lf> T with u>ij = 1 and c Ok,i = 1 for all i = 1,..., N — 1, j = i + 1,..., N, 
k — 1,..., M — 1 and l — k + 1,, M. Tables [l] and [2] display the results for different 
values of M and N, in the cases of Scheme 1 and Scheme 2, respectively. 

For both schemes, the estimation is more accurate (the mean bias and the standard 
deviation decrease) as M and N increase. Moreover, we observe that the estimation of 
the spatio-temporal parameters a and r is satisfactory and clearly more accurate than 
that of the purely spatial parameters cr n ,(Ti 2 and cr 22 (the mean bias and the standard 
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True 

Pairwise likelihood (M=20, N=20) 


Pairwise likelihood (M=30, N=30) 


Mean estimate 

Mean bias 

Stdev 


Mean estimate 

Mean bias 

Stdev 

0-11 = 1 

1.139 

0.139 

0.421 


1.105 

0.105 

0.232 

0-12 = 0 

0.040 

0.040 

0.286 


- 0.024 

- 0.024 

0.162 

0- 2 2 = l 

1.185 

0.185 

0.325 


1.066 

0.066 

0.254 

P 

II 

o 

-4 

0.707 

0.007 

0.059 


0.701 

0.001 

0.026 

r i=-l 

- 0.990 

0.010 

0.123 


- 0.999 

0.001 

0.032 

T 2 =-l 

- 0.990 

0.010 

0.101 


- 0.998 

0.002 

0.043 


Table 1: Performance of the estimation in the case of Scheme 1. The mean estimate, the 
mean bias and the standard deviation are displayed. 


True 

Pairwise likelihood (M=20, N=20) 


Pairwise likelihood (M=30, N=30) 


Mean estimate 

Mean bias 

Stdev 


Mean estimate 

Mean bias 

Stdev 

o-n = l 

1.288 

0.288 

0.678 


1.239 

0.239 

0.483 

0-12 = 0 

0.043 

0.043 

0.621 


0.057 

0.057 

0.314 

0*22 = 1 

1.453 

0.453 

1.159 


1.264 

0.264 

0.574 

P 

II 

o 

0.706 

0.006 

0.050 


0.700 

0.000 

0.016 

Tl =-1 

- 0.998 

0.002 

0.115 


- 1.002 

- 0.002 

0.034 

T 2 =-l 

- 0.982 

0.018 

0.111 


- 1.003 

- 0.003 

0.035 


Table 2: Performance of the estimation in the case of Scheme 2. The mean estimate, the 
mean bias and the standard deviation are displayed. 


deviation are lower). Finally, the estimation of the purely spatial parameters is more 
accurate when using Scheme 1 (the mean bias and the standard deviation are lower). 
This stems probably from the fact that in Scheme 2, the number of pairs used is higher 
than in Scheme 1, introducing more variability. Indeed, contrary to what is assumed in 
the pairwise log-likelihood, the pairs considered are not independent. This dependence 
generates instability. For a discussion about the impact of the choice of pairs on esti¬ 
mation efficiency, see Padoan et al. (2010), p. 266 and 268. This finding shows that 
from a statistical point of view, spatio-temporal max-stable models that allow a separate 
estimation of the purely statistical parameters can be preferable; needless to say that a 
more extensive analysis would be needed at this point. 


5 Concluding remarks 

In order to overcome the defects of the spatio-temporal max-stable models introduced 
in the literature, we propose a class of models where we partly decouple the influence of 
time and space in the spectral representations. Time has an influence on space through 
a bijective operator in space. Then, we propose several sub-classes of models where our 
operator is either a translation or a rotation. An advantage of the class of models we 
propose lies in the fact that it allows the roles of time and space to be distinct. Especially, 
the stationary distributions in space can differ from the marginal distributions in time. 
Moreover, the space operator allows to account for physical processes. Our models have 
both a continuous-time and a discrete-time version. 

Then, we consider a special case of some of our models where the function related 
to time in the spectral representation is the exponential density (continuous-time case) 
or takes as values the probabilities of a geometric random variable (discrete-time case). 
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In this context, the corresponding models become Markovian and have a useful max- 
autoregressive representation. They appear as an extension to a spatial setting of the 
real-valued MARMA(1,0) process introduced by Davis and Resnick (1989). The main 
advantage of these models lies in the fact that the temporal dynamics are explicit and easy 
to interpret. Moreover, these processes are strongly mixing in time. We also show that 
the processes we introduce on the unit sphere of R 3 are geometrically ergodic. Finally, 
we briefly describe an inference method and show that it works well on simulated data, 
especially in the case of the parameters related to time. The detailed study of possible 
estimation methodologies for our class of models will be considered in a subsequent paper. 
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A Poisson point process on Z 


For A C Z, let 5(A) = ^(4 (^) = #AL where # stand for the cardinality of a set. 

The point process N (A) = J^kGA^k, A C Z, defines an homogeneous Poisson point 
process on Z with constant intensity density function equal to 1 since: 


• due to the additivity of the Poisson distribution, N (A) is Poisson distributed with 
parameter 5(A); 

• for any l > 1 and A 1; ..., Aj disjoint sets in Z, the N (Aj), i = 1 ,...,/, are inde¬ 
pendent random variables. 


B Proofs 

B.l For Theorem |T| 

Proof. For M G 1N\{0}, let t G Z, Xi,.. ., G R 2 and 2 i,..., Zm > 0, we deduce by Q 
and (j6j) that 


log(P(X(t,xi) • ,X(t,x M ) < z M )) 

M r V t (b)V R(t ^ m {c) 



Zn 


Hi(db) ii 2 {dc) 


Vr, 


it,bp 


(c) 


| fi 2 (dc )^j H!(db) 


P 2 (dc) / V t (b)ni(db) 

J E 1 

1*2 (dc). 


We now show that Assumption ([6]) is satisfied for models of types 1, 2, 3 and 4. 

For models of type 1, we have E 2 = R 2 , V R (t _ i)Xm (c) = f{R(t,b)*m ~ c) = /(x m - (c + (t - 
6 )r)) = Vx m (c+ (f — 6)r) and /x 2 = A 2 . Since A 2 is invariant under translation, we derive 
by a change of variable that 




Fx m (c+(t ~ b)r) 



H 2 (dc) 


For models of type 2, we have E 2 = S 2 and 




(t,bp 


[(c) f (R&(t—b),vXmi C, hi) 


hi 


47 T sinh k 


exp (hi(R^e( t -b),uc)'x) = F Xm (A-0p-b),uc) 
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and it follows, since /i 2 = Ag 2 is invariant under rotation, that 




Vx. m (R-o(t-b), u c) 



H 2 (dc) 


For models of type 3, we have 14(c) = c(x). Thus, if R(t,b)X- = x — tr, we have 
14 (tb) x m (c) = c(x m — tr). Thus, we deduce by stationarity that 




fj>2 (dc). 


If i?( tj b)X = Tx, we have Vr Xrn (c) = c(^4 t x m ). Hence, we obtain by isotropy that 




p 2 (dc). 


For models of type 4, we have 14(c) = c(x). Thus, if R( t ,b ) x = x — (t — 6 )r, we have 
14 (tb) x m (c) = c(x m — (f — 6 )r). Hence, we deduce by stationarity that 




^ 2 {dc). 


□ 


B.2 For Theorem [2] 

Proof. For M G 1N\{0}, H,..., G Z, x G R 2 and 2 i,..., Zm > 0, we have that 


- log (P(X(4,5 (tl) x) S(t M )x) < z M )) 


M 

/ v 

' Ei x E 2 m=1 


Vtm. (^) i6) 5( tm) X (c) 


Hi(db) fi 2 (dc) 


M 

f , V 

E1X.E2 m —l 


V,Jb)Vc ln Ac 


li\{db) n 2 (dc) 



V G{b)X (c)fi 2 (dc) 


Hi(db) 



Moreover, it is easy to show that Assumption (Jr]) is satisfied for models of types 1, 2, 3 
and 4 with the operators S( t \ that are given. □ 
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B.3 For Theorem [3] 

Proof, i) Let us consider the case X = R (the case X = Z is similar). We have that 


X(t, x) = \J {Uiuexp(-u(t - R i ))I {t _ Bi >o}Fx-(t-B i )r(C'i)} 

2— 1 
OO 

{UiV GXp( v{s ^~ t S -^2))-^{s+t_s_5i>0} (Ci) } 

2— 1 

OO 

= max (exp(—zv) lS \/ {[/^exp(-z/(t - 5 - ^))I {t _ s _ B .> 0 }V;_ ST _( t _ s _^)r(^))} , 

2=1 
OO 

\J {U{V exp(—z/(t - 5*))I{ t>Bi>t— s} U-W-^rtCi))} 

2=1 

= max ^exp(— u) s X(t — s,x — st), (1 — exp(— u) s )Z(t, x) j, 


where 


z (t, x ) = jrz — 1 \g\ V {FiZ/exp(-z/(t - R i ))I {t > Si>t _, } F x _ (t _ Bi)T (C' i ))} 

l 1 ex Pl v) ) i>i 


Since the sets {t > B > t — s} and {t — s > B} are disjoint, the Poisson point processes 
{(Ui,Bi,Ci),i : t > B t > t — s} and {(Ui, B b , Cf), i : t — s > B t } are independent and it 
follows that (X(t — s,x )) xeR 2 and (Z(t, x )) xeK 2 are also independent. 

We now show that 

OO 

Z{t,*)±\J {UiV^Ci)}, x G R 2 , 

2=1 

where (Ui, Cf)i>\ are the points of a Poisson point process on (0, oo) x E 2 of intensity 
vr 2 du x /i 2 (dc). Let (Ui,Bi,Cj) i>x be the points of a Poisson point process on (0, oo) x 
Rx£ 2 with intensity vr 2 du x db x /r 2 ( dc ). For M G 1N\{0}, let x 1; ..., x M G R 2 and 
z\,... , zm > 0. We consider the set 


Bz 1,-,Z M 

= {(«, b, c ) : uuexp(-u(t - b))I {t > b>t _ s} V Xm _( t _ b)T (c) > z m for at least one m = 1,..., M}. 
Denoting by /\ the min-operator, the Poisson measure of B Zu ^ Zm is 


A-{B Z -l,..., z M ) — 



' E 2 J H J 0 
M 


|“>Am=l| 


^exp(-^(4-6))I {t > 6>t _ s jV Xm _( t _ 6 )T(c) 


u ~du x db x /i 2 (dc) 


V {> db x lMdc) 

e 2 JU m=1 



= uexp(—ut) / I{ t > b>t -s} exp(ub) 

Jn 


r M 

/ v 

' E 2 m= 1 


Fc m —(t—6)-r( c ) 
%m. 


/i 2 (dc) ) db. 




M dc), 
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Since 








we deduce that 



It follows that 


log(P(Z(£,xi) < z 1} .. .,Z(t,x M ) < z M )) 


M 


^-(-S(l-exp(-i/) s )2i,...,(l-exp(-i/) s )2 M ) / \/ 

m =1 


K m (c) 


/Uo(dc). 


m 


ii) It is easily shown that the right-hand side of (12) is solution of (10). Moreover, as 

□ 


Davis and Resnick (1989), this solution is unique, yielding (12). 


B.4 For Proposition [T] 


Proof. For the sake of notational simplicity, we only give the proof in the case M = 3; this 
proof can easily be extended. Using the independence of the replications (Z(t, x )) xeK 2 
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and changes of indices, we obtain 


j 


j 


P[ \J {cd(l - a)Z(t i - j,xi -jr)} < z i, \/ {a J (l - a)Z(t 2 -j,x 2 ~ jr)} < z 2 , 
. j =o j=o 

J 


V M 1 - a )Z(t3 — j, x 3 — jr)} < 2:3 


= P 


j=0 

p ( V _ a ) z ( t i - i> x i - jr)} < *i, 

V i=o 
J ~\~t\ —^2 

V {a J+t2_tl (l - a)Z(ti - j,x 2 - (j + t 2 - ti)r} < z 2 , 

j=ti~t 2 

V {a J+t 3 _tl (l, - a)^(ti - j,x 3 - (j + t 3 - ti)r} < z 3 

j=ti-t 3 

V {a J (! - a)Z(ti -J, X 1 - jr)} < 21, 

j=o 

J+ii—*3 

V {a J+t2_tl (l -a)Z(ti - J,X 2 - (j + t 2 — ti)r} < z 2 , 

3 =0 

J+ii—*3 

V {a 3+t3 ~ tl (1 -a)Z(ti - J,X 3 - (j + t 3 — ti)r } < z 3 

j=0 / 

p( V {a J+t2_tl ( 1 - o)Z(ii - j,x 2 - (j + t 2 - ii)r)} < z 2 , 

V j=t\~t 2 

-1 

V {a 3+t3 ~ u (t - a)Z(ti - j,x 3 - (j + t 3 - ti)r)} < z 3 

j=tl-t2 

X P V {a J+t3_tl (l - a)Z(ti - J,X 3 - (j + t 3 ~ ti)r)} < z 3 

\j=ti-t 3 

xP V {a J (l-a)Z(t 1 — j,x 1 -jr)}<z 1 

\J+t\ — <t2 “I - 1 

( J +tl —^2 

V {a J (l — a)Z{ti — j, x i — jr)} < zi), 
j = J £3 + 1 

«/+tl — ^2 

V {cd +t2_tl ( 1 - a)Z(ti - j,x 2 - (j + t 2 - ti)r)} < z 2 

£3 + 1 


X 


( 21 ) 
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Using the independence of the replications (Z(f, x)) xe]R 2 , the stationarity of the processes 
(Z(t,x)) xg R 2 and the homogeneity of order —1 of V, we obtain 


P 


( J+t\—t 3 

V M 1 *a)Z{t-L - J,xi - jr)} < z H 

3 =0 

J+tl—ts 

V (! - a)Z(ti - j. x 2 - (j + t 2 -U)r)} < z 2 , 

j=o 

J+t\—t 3 

V {a J+t3_<1 (l — a)Z{ti ji x 3 (j + t3-U)r)} < £3 
j=o 
J+ti-t 3 


btl-ts / 

II p (^(*1 - h X 1 - J T ) < 


Z1 


3=0 


V 


od(l — a) 


, Z(ti - j, x 2 - (j + t 2 - U)r) < 


^2 


a i+«2-ii(l _ a )’ 


- j, x 3 - (j + t 3 - U)r < 


£3 


a t+t3-<l(l — a ) 


J+tl-t 3 

exp ( _ ~ , V Xl _j TiX 2 _(t 2 _t 1 ) T _j T)X 3_(i 3 _t 1 ) T _j T - 


i=o 


21 


^2 


^3 


aJ>( 1 — a) ’ a J a t2_tl (l — a) ’ a J ’a t3_<1 (l — a) 


^2 ^3 

a* 2_tl ’ a* 3_tl 

^2 ^3 


exp ^ V Xl ,x2-(t2-ii)r,X3-(t3-ti)r ^1? 

= exp ^-V xl)X 2 _( t 2 _ tl ) T)X 3 _( t 3_ tl ) T (zi, at2 _ tl , at3 _ tl 
A similar calculation yields 


J +tl —^3 


x (! - a ) Y1 


x (1 


a 


i=o 

j+ti—£ 3+1 


( 22 ) 


IP ( V I"' ' 1 '"il«| ir) | •£ 

d=tl~t2 
-1 

V {a J+t 3 -tl (l-a)Z(ii jj x 3 (j + t 3 -U)T)} < 


j=ti-t 2 


— exp ^ V X 2 )X 3 _( t 3_t 2 ) x ^ 


^2 ^3 


a t 2 -U ’ a i 3 -*i 


) x (1 - a* 2 "* 1 )) . 


Furthermore, we have that 
ai-t 2 -i 


P I V { aJ+t3 41 1 1 _ a)Z{t\ - J, x 3 - (j + t 3 - U)r)} < £3 

\j=ti-t 3 
ti-t 2 -i , 

= n p ( - j, x 3 -.(i+u - u)t) < 


j=tl-t 3 


= exp 


£3 


aPa * 3- * 1 (1 — a) 


a * 3 4l (l — a) 


= exp 


£3 

1 - a * 3- * 2 
^3 


ii — £2 — 1 

E - 

tl—*3 


(23) 


(24) 
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and similarly 


P 


V a J (l — a)Z(ti — j, xi — jr) < z x 

\J -\-ti —t 2 + l 


= exp 


a 


J+t\— *2+lM _ 0^2—tl^ 


Z1 


Finally, a similar calculation as in (22) yields 

( J -\~t\-t2 

V { flJ (l - a)Z{t\ -i,X! - jr)} < Zi, 

j—J +tl —^3 + 1 
J -Hi —^2 

\/ {a J+ * 2_tl (l -a)Z(ii - j,x 2 - (j + t 2 - h )r)} < z 2 

i— J +ti—t3+i 

= exp f-V, | X1+! ,) a' ,+ ‘‘"‘ s+1 (l - a 1 '"* 1 )) . 


(25) 


Inserting (22), (23), (24) and (25) in (21), we obtain, since linij _ s . 00 a J = 0, 

P(X(t 1 ; xi) < z u X(t 2 ,x 2 ) < z 2 ,X(t 3 ,x 3 ) < z 3 ) 

/ j J 

jirn P[ \J {a J (l - a)Z(t x - j,x. x - jr) <} z 1} \J {a J ( 1 - a)Z(t 2 - j, x 2 - jr)} < z 2 , 


<j=o 


j=o 


V M 1 “ a ) z ( t 3 - J,x 3 - jr)} < z 3 

3=0 

= exp ^ V Xl: x 2 _(t 2 _t 1 ) TjX 3_(t 3 _t 1 ) T ^i, 
x exp (— V X 2 , X 3 _(i 3 _t 2 ) 


z 2 Z 3 


a* 2- *! ’ a * 3- * 1 


**>»»-. 


□ 


B.5 For Proposition [2] 

Proof. Applying 0 with M — 2 and setting z x = z 2 = u for w > 0, we obtain 


P(X(t!,xi)) < u, X(t 2 , x 2 ) < it) = exp (-V Xl)X 2 _(t 2 _ tl)T [u, 


u 


t2~tl 


a 


exp 


a 


t2~tl 


u 


= exp 


FxLXa-te-tQr (1, Q* 1 + 1 ~ a t2 fl 

u 


yielding (15) by definition of the spatio-temporal extremal coefficient. 


In the same way as in the purely spatial case (see e.g. Cooley et al. 2006, p.379), it 
is easy to show the following link between the spatio-temporal < h 1 -madogram and the 
spatio-temporal extremal coefficient: 


u F {t 2 - ti,x 2 - Xi) 


1 d(t 2 - fi,x 2 - Xi) - 1 

2 0(t 2 -fi,x 2 - Xi) + 1 


1 1 

2 d(t 2 — fi, x 2 — x 3 ) + 1 ’ 


(26) 


Inserting (15) into (26) gives 


m 


22 


























B.6 For Theorem [4] 


Proof. We show that the two assumptions appearing in Theorem 3.6 in Hairer (2010) are 
satisfied. 

Assumption 1. There exists a function L : C $2 —¥ [0,oo) and constants K > 0 and 
7 G ( 0 , 1 ) such that 

E [L{X{t, -))| x(t -1,0 = K •)] < 7 L{h) + K 

for all h e C$ 2 . 

Let us choose L(h ) = llh 7 !^ with 0 < 7 < 1. 

First, note that the (U l ) l>l in (18) satisfy {Uj) l>x = 1 ) i>1 , where (-Pj)i>i are the points 

of an homogeneous Poisson point process on R + with constant intensity equal to one. 
Hence, the highest U\ corresponds to the smallest Pi , which is exponentially distributed 
with parameter 1. Hence, its inverse follows the standard Frechet distribution. Moreover, 
for / defined in (J5]) , ||/||oo is reached for x = pi and is finite. Therefore, the process Z 
defined in (18) satisfies 

Halloo = F 7 II/ 7 IL» (27) 

where Y is a random variable with standard Frechet distribution. Moreover, 


||max(ah(i? 0 iU x), (1 - a)Z(t,x)) 1 \\ oo = ||ma x(a r h 1 (R 9yU x), (1 - a) 7 Z 7 (t,x) 

< max (a 7 H/i 7 ^ , (1 - a ) 7 ||F 7 ||J. 

Using ((271) and ([28]), we obtain 


(28) 


E [L(X(t. -))|.Y(i — 1 , ■) — h(-)\ 

= E[||max(a 7 /i 7 (i?e lU x), (1 - a) 7 Z 7 (£,x))|| 0o ] 

<E[max(a 7 ||ft 7 || << ,,(l-or ||Z 7 ||J] 

= « 7 Halloo E (e 7 < (1 - ||/"|| ) + (7 - a ) 7 ll/llooE 

< « 7 Halloo + (l-a) 7 ||/|Ur(l- 7 °) 

= a?L{h) + (1 — a) 7 ||/|| oo r(l — 7 ), 


F 7 1 


{V7> 


^\\h"'l\ 00 

(1-a) 7 II/ 7 II oo 


yielding Assumption 1 . 

Assumption 2. We denote by ||.||tv the total variation distance between two probability 
measures. For every R > 0, there exists a constant a > 0 such that 


sup \\V hl - V h2 \\ TV < 2(1 - a), 

hifaGD 

where D = {hi, h 2 : L{hf) + L{h 2 ) < -R}, or equivalently 

sup |E [<p(X(t, -))| X(t - 1, 0 = MO] - E b(X(f, -))|X(f - 1, 0 = M0]l < 2(1 —ck). 


Using © , we have that 

|E [<p(X(t, 0)1 x(t -1,0 = hi{ 0] - E b(A(t, 0)|A(f - 1, 0 = M0] \ 

= |E [<^(max(a/ii (Re :U x) , (1 - a)Z(t,x))) - ^(max(ah 2 (-Mux), (1 - a)Z(t,x)))]| 

< E [|</?(max(ahi(i?6i, u x), (1 - a)Z(t,x ))) - ^(max(a/i 2 (fio,ux), (1 - a)Z(t,x)))|]. (29) 
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Moreover, for b,c, z G Cg 2 , we know that 


|</?(max(i)(x), z(x))) — <^(max(c(x), z(x)))| = 0 Vx G S 2 
if and only if max( 6 (x), c(x)) < z(x) Vx G S 2 . 

Therefore, for functions ip satisfying ||<^||oo < 1, 


E [|^(max(a/ii(i?e !U x), (1 




< 2 ^1 - P ^ Q | Z(t,x) > ^ Q _ ^ max(hi(f?@ iU x), h 2 (P 0iU x)) 


— 2 M— p | fl |z(tx)> 
V Vxes 2 ^ 


max(hi(x), h 2 (x)) 


a 

(1 - a) 
a 


max(hi(x), h 2 (x)) 


<2 ^1-P ^/\{Z(f,x)} > ymaxdl/iiHoo, Hhall^) 

The quantity /\ x(=s 2 {/(x; /x 1 , k)} is reached for x = ~ii 1 . Thus, 

OO 

Z(x) = V {^/( x ; Pn K )l > P r'/( x ; Mi, «) > A -1 A ^( x; Pi’ K )} 

i=l xes 2 


= p; 


It follows that 


P ( A {^( x )} ^ (1-a) max (H^iHoo> II^IL) 


^xeS 2 


> P Pi 


>-i 


K 


An sinh k 


exp (— k) > 


( 1 -a) 


maxdlhiHoo , ||h 2 ||oo) 


^ , „_i 47 T sinh Kd . . „ . 

> P [ P 1 1 > —-— exp (k) R ) , 


k(1 — a) 

noting that max(||hi|| , \\h 2 \\ ) < R. 


Therefore, combining (29), (30) and (31), we obtain 


(30) 


'i 1 -———exp (—k) . 
An smh k 


(31) 


sup |E [ v {X(t, -))| X(t - 1, •) = fti(.)] - E {<p(X(t, ■))\X(t-l,-) = h 2 (- 

hifo^D^tpW^Kl 

i ^ i i An sinh na , . \ \ 

< 2 f 1 - P f Pr 1 > —-- exp (k) R = 2(1 - a), 


denoting 


1 k( 1 — a) 

a = P ( Pf 1 > 


_i . An sinh na . . . 

^ —-- exp (k)R > 0 . 

k (1 — a) 


Hence, Assumption 2 holds. 


Finally, the application of Theorem 3.6 in Hairer (2010) yields the result. 


□ 
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B.7 For Proposition [3] 

Proof. We have that 


/(tl,xi),(t2,x 2 )(- z l) z 2 , &) 

_d__d 

dz\ dz 2 


exp -V XliX 2 _ (t 2 _t l)T [z u 


z 2 


7*2 ~tl 


1 - df 2 ^ 1 


“ d Zl ( 6XP ( ' Vxi > X2 " (ta - tl)T ( Zu 


1 - 


kt2~tl 


z 2 


1 d ( Z 2 

x ' dz 2 VxiiXa “ (t2 - tl)r w 1 ’ <P 2_tl 


1 — a 


t2~tl 


7 2 
*2 


= exp -V XliX 2 _p 2 _t l)T Zi, 


z 2 \ 1 - 0 t2 


a t2_tl 


— ^1 


* I ~ F XljX 2 _(7 2 _i 1 ) T ) + 


dzo 


a 


t2~tl 


z 2 

1 — a t2 il 


x 


d 


dz 


■V XliX 2 _(t 2 _ tl ) T (£ 1 , 


£2 


* 2—*1 


a 


, , ,, 1 „ ^2 \ 1 - 0‘ 2 4l \ ( d d ( z 2 

+ exp I V xliX 2 _(t 2 _ tl ) x (^l, ) ^ ) x ( a~ V xliX 2 _ (t 2 _i l)T Ui, 


a 


^2 


dz\ dz 2 


yielding the result. 


a t2_tl 

□ 


B.8 For Corollary [l] 

Proof. We denote by w = \+\ log ^ j and u = h—w , where h = (x 2 — xi)'S _ 1 (x 2 — X0 

From Padoan et al. (2010), p.275, we know that 


Ay r~ x = !A2+ALiA _A V rr = Al, ALAl and 

ft- Vxi,X2\ z h Z V 2 + U ~2 h7 7 ’ rxi,x 2 ^lj^2j 2 + 7.^2 L - ~ and 

C//6X HZ 1 ilZ\Z2 C/2-2 ^2 '^2 HZ\Z2 


< 9 2 v 0 (w) 

dz\dz2 Xl ’ Xzl 1J 2J h 2 ^ 2 


Hence, we obtain 




^2 ^ = 0(wQ _ q* 2 fl 0(ui) 


»,x 3 -(i 2 - t i) T pi, „ t2 _ tl ) .2 T ftjZ 2 


(32) 


d .. ( z 2 

d z 2 ^ 2 - {t2 - tl)T \ Zl, a^- tl) a t2 ~ 


1 /a 2 ^ 2 ^$(ui) a 2 ^ 2 q* 2 tl 0(in i i) \ 

!_tl \ ^2 + ^1^2 h\Z\Z2 ) 

(33) 


a *2 *l$( Vl ) a *2 *10( t , 1 ) <f{Wl) 

Z% hi z 2 h\Z\Z2 


and 


d 2 ,, 7 -2 \ 

'3^2 xl>xa “ (t2 " tl)T \ 1 ’ a t2_tl / 


1 /q * 2 *Wi 0 (w;i) q 2 ^ 2 tl ^i 0 (ni)\ 

a t2 ~ fl V h\z\z 2 + h\z x z\ ) 

Vi<f{wi) q* 2_tl Wi0(ni) 


fr?Z 2 Z 2 


tilz-tzl 


( 34 ) 
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Finally, we have that 


v ( y -2 \ _ 

v xl , X2 -( t2 - tl)T i, at2 _ tl ) zi + 


$(wi) a t2 (l $(ni) 


^2 


Inserting (32), (33), (34) and (35) in (19), we obtain the result 


(35) 

□ 
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